UPPER TAILS FOR COUNTING OBJECTS IN 
RANDOMLY INDUCED SUBHYPERGRAPHS AND 
ROOTED RANDOM GRAPHS 
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t Abstract. General upper tail estimates are given for counting edges 

in a random induced subhypergraph of a fixed hypergraph Tt, with an 
easy proof by estimating the moments. As an application we consider the 
numbers of arithmetic progressions and Schur triples in random subsets 
of integers. In the second part of the paper we return to the subgraph 
counts in random graphs and provide upper tail estimates in the rooted 
Ph case. 
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1. Introduction 



Consider a finite sum of dependent random variables of the following form. 
Let T be a finite ground set and let S be a family of its subsets. Let T p be a 
J> ■ random, binomial subset of T which independently includes each element of 

CN \ r with probability p. Finally, for each S € S, let Is be the indicator random 

variable of the event {S C T p }. Then X = X(T,S,p) = YlseS^-S counts 

■ the number of members of the family S contained in a random subset T p . A 
lot of research has been devoted to the study of the asymptotic distribution 

■ of X when the order N = \T\ grows to oo and p = p(N), both in a general 
0^ , setting and for particular instances, most notably for random graphs, see 

One feature which received a lot of attention is the rate of decay of the 
tails of X, the lower tail ¥(X < tEX) for < t < 1, and the upper tail 
$_i ■ ¥(X > tE,X) for t > 1. Good estimates for the lower tail follow from 

the FKG inequality (lower bound) and Janson's inequality (upper bound), 
see [7], Section 2.2. Often, these two bounds asymptotically match under 
some restrictions on the dependencies among the summands Is- This is, in 
particular, the case of subgraph counts in random graphs, see [7], Section 
3.1. 

The upper tails tend to be harder to analyze. Some ad hoc results can 
be found in [7J, [TO], [S], [6], among others. For the subgraph count problem 
a quite satisfactory and complete result has been obtained in [4], where 
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the logarithms of the upper and lower bound on ¥(X > tEX) are of the 
same order of magnitude except for a logarithmic term. A generalization to 
random hyper graphs can be found in pQ . 

This paper can be viewed as a follow-up paper to [4j- Using the proof 
techniques developed therein, those results are extended in two directions. 
First, we return to the more general model of set systems (or hypergraphs) 
and obtain some straightforward estimates for the upper tail of X, covering, 
in particular, the number of arithmetic progressions of given length in a 
random subset of integers. Then, we return to the subgraph counts to study 
the rooted version of the problem, only to discover some unexpected features 
there. 



2. Counting edges of randomly induced subhypergraphs 

Let TL be a k- uniform hypergraph on a vertex set V with |T| = N and 
with \TL\ = aN q edges, where a = a(N) > and < q < k. Consider a 
random, binomial subset T p of T, where < p = p(N) < 1, and the random 
variable X = \TC[T p ]\ counting the edges of H that are entirely present in 
T p . Note that 

p:=EX = \H\p k = aN q p k . 

For j = 0, 1, . . . , k, let 

A,- = max \{T € H : T 5 S}\, 

i.e., the maximum number of edges that contain j given vertices. 

Theorem 2.1. Let q be an integer, 1 < q < k, and let ao > and t > 1 be 
real numbers. There exists a constant c = c(q,ao,t) such that ifH satisfies 
the following four conditions: 

(i) a(N) = \H\/N* > a , 

(ii) for allj < q we have Aj = 0{N<i~3), 
(hi) for all j > q we have Aj = 0(1), 

(iv) there exists C > and Tq C T such that \Tq\ < Cp}l q and \7i\Po]\ > 
tix, 

then, with X = |H[rJ|, 

p c ^ /q = exp{-C/i 1/9 log(l/p)} < P(X > tp) < exp{-cp 1/q }. 

Before giving the proof, we make some comments. 

• The two exponents are of the same order of magnitude except for 
the logarithmic term log(l/p); this inaccuracy disappears obviously 
for p constant. 

• Note that P(X > tp,) > tp < \H\ tp k < 1, so the 
theorem is interesting for t < p~ k only. (For larger t, P(X > tp) = 
so the lower bound fails, while the upper bound is trivial; further, 
(iv) fails.) 
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• Condition (iii) is redundant, since it follows from (ii) with j = q, but 
we prefer to include it explicitly for emphasis, and for comparison 
with Theorem 12.21 which allows non-integer values of q (note that for 
non-integer q, (iii) does not follow from (ii)). 

• As we will see in the proof, the upper bound follows only from con- 
ditions (i) - (iii) , while the lower bound is a consequence of condition 



(iv) alone. 



Proof. Take C and Tq as in assumption (iv) We have 



\x > tn) > P(r p d r c 



which proves the lower bound. 

For the upper bound, we use the same approach as in [3]. By Markov's 
inequality, for every m we have 

IE X m 

¥(X > tri) < . 

It remains to show that for a sufficiently small c\ = ci(q,ao,t) and m = 
[ci^l we have, say, EX m < t m / 2 fi m . 

Having chosen m — 1 (not necessarily distinct) edges Ex, . . . , E m -\ of H, 

j, and let 
0, 

(2.1) 



let Nj be the number of edges E m such that \E m n UIHi 1 



JV- 



E 



k>j Nk- We estimate these numbers as follows: For j 
No < N >0 = \H\. 



For 1 < j < q, by (ii; 



N j < N >j = 0(m j Aj) = 0(m j N q - j ), 
n Ui^i 1 -^*! — i) then there exists a set A C \J 



(2.2) 



m 1 1 Ei with 



since if \E m 

\A\ = j and E m D A, and there are O^ 3 ) such sets A, and at most Aj 
edges i? m for each A. For j > q we obtain 

< iV> g = 0(m 9 ) (2.3) 

from (with j = g). 

Arguing as in [3] we have from (|2,ip - (|2,3p . by induction on m, 

f q k 

EI m < J \H\p k + ^20(m^N q ^)p k -^ + 0(m g )p k -i 
V i=i i=9+l 



m— 1 




TO— 1 



^ m 1 + 0(o" 



for every m > 1. Now choose m = [ci/i 1 / 9 ] > 1, as said above. If m > 2, 
then m/(Np) < 2dfi 1/q /{Np) = 2c 1 a 1 / q p k / q - 1 < 2aa 1/q , and thus, using 



(i) , the term in parenthesis in the last line can be made arbitrarily close to 



1 for all m > 2 by choosing c\ > small enough; in particular, it can be 
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made less than t 1 / 2 . Hence, for the chosen m, EI™ < i" 1 / 2 //™ if m > 2, 
and trivially if m = 1 too. This completes the proof. □ 

In the case of non-integer q, the upper bound gets further away from the 
lower bound. Indeed, we then have the following result. 

Theorem 2.2. Let q, ao and t be real numbers, with < q < k, ao > 
and t > 1. There exists a constant c = c(q,ao,t) such that under the same 



assumptions (i) - (iv) as in Theorem \2.1\ 

F(X > tfi) < expj-cmax^/V^ 1719 ^ 1 ^,/" 17 ^ 1 )} 

and 

P(X > tfi) > p c » 1/q = exp{-<V /<? log(l/p)}. 

Proof. The only difference in the proof is when we bound Nj to estimate 
EX m . Namely, for j > \q], we either use Nj < N>^ = 0{wX q ^ N q ~^), or 
Nj < iV>r^ =0(ra^). We then choose 

m = [cimax^/CVLgJ-V^^i/rgl)] 

for a small constant c±. (We may assume /x > 1, since otherwise m = 1 and, 
recalling that t > 1, the estimate EX < i 1//2 // is trivial.) □ 

2.1. Integer solutions of linear homogeneous systems. For an I x k 

integer matrix A, where I < k, assume that every I x I submatrix B of A 
has full rank r{B) = I = r(A). Consider the system of homogeneous linear 
equations Ax = 0, where x = (xi, . . . ,Xk) is a column vector and is a 
column vector of dimension I. We assume also that there exists a distinct- 
valued positive integer solution of Ax = 0. These assumptions seem to be 
quite restrictive, but, in fact, we cover at least one important case: the 
arithmetic progressions of length k which can be viewed as distinct-valued 
solutions to a system of / = k — 2 equations. 

Let T = [N] := {1, 2, . . . , N} and < p = p(N) < 1. Then T p is a random 
subset of the first N integers with density p. Define a fc-uniform hypergraph 
TLa = ~Ha{N) as the family of all solution sets {x±, . . . ,Xk} of the system 
Ax = with Xi distinct and in [N]. Let us check that for some ao,q, and 



C the assumptions (i) - (iv) of Theorem 12.11 hold, at least in the interesting 
case (j, = \H A \p k > 1 and tp, < \H A \, which can be equivalently restated as 

fjt > 1 and t< p~ k . (2.4) 

Set q = k — I. 



(i) , (iv) : We will show that there exists ao > such that for sufficiently 
large m < N we have 

\n A (m)\ > a m q . (2.5) 



Taking m = N in (|2.5p we obtain \TCa\ > a oN q , which is (i) Taking 
m = min([~(tag l p) l / qn \ , iV) in (|2.5p and Tq = [m] we obtain (iv) with C = 
2(ta ~ 1 ) 1//|J , using the assumptions in (|2.4j) . 
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Let xo £ Z k be a positive integer solution of Ax = 0. Let Mq by the 
largest of its coefficients xq\, . . . , xofc- Let Xi, . . . , x g be q linearly indepen- 
dent integer solutions of Ax = 0. (There exist q linearly independent rational 
solutions, and we may multiply these by their common denominators and 
thus assume that they are integer solutions.) Let M be the maximum of the 
absolute values of the coefficients in x%, . . . , x q . 

Given m, let d := [_m/(Mo + 1)J. For any integers aq, . . . ,a q , the sum 
^ x o + Y^i=i a « x i yields an integer solution of Ax = 0, and these solutions 
are all distinct. If further |aj| < d/(2qM) for all i, this solution has all 
coefficients positive, less than m, and distinct. The number of these solutions 
is 9((£?) = 9(m«). Hence, (gSJ holds. 



(ii) , (iii) : By elementary algebraic properties of systems of linear equa- 
tions, every system By = c, where B is an integer I x h matrix, has no more 
than N h ~ r ^ solutions in [N]. Thus, A = \H A \ < N k ~ l = N q . For every 
subset J of the columns of A, define Aj as the sub matrix obtained from A 
by removing the columns in J. This means that when we fix values of some 
j variables, then the obtained system of equations is of the form By = c, 
where y consists of the remaining unknowns, B = Aj, and J is the set of 
columns of A corresponding to the fixed variables. Hence, the number of 
solutions with j given elements corresponding to the given columns J is at 
most N k ~^~ r ^ J ' . Now, for all j < q = k — I, if \ J\ = j then, by our assump- 
tion on A, r(Aj) = I, so (summing over J) Aj = 0(N k ~i~ l ) = 0(N q ~ l ). 
On the other hand, if j > k-l then r(Aj) = k-j, so Aj = O(N ) = O(l). 

Hence, given (|2.4p . Theorem 12.11 applies for such TIa with q = k — I and 
M = Q(N k - l p k ). 

Example 2.3. In particular, we obtain quite sharp estimates for the tails 
of the numbers of arithmetic progressions of length k in [N] p . Indeed, they 
are given by the system Xi — 2xi + \ + xi + 2 = 0, i = 1, . . . , k — 2. It is easy 
to check that for I = k — 2 every I x I submatrix has full rank, and we have 
the following result. 

Corollary 2.4. Let X be the number of arithmetic progressions of length 
k in [N] p , k > 3, and let p, t > 1, and p satisfy (|2.4f) . Then there exist 
c, C > such that 

p CN P */ 2 = exp {_ C Np k / 2 log(l/p)} < F(X > tfi) < exp{-cNp k / 2 }. □ 

Example 2.5. A Schur triple is a triple {x, y, z} of positive integers such 
that x + y = z, x ^ y. In this case we have k = 3, I = 1 and so, q = 2. 

Corollary 2.6. Let X be the number of Schur triples in [N] p , and let n, 
t > 1, and p satisfy (I2.4p with k = 3. Then there exist c, C > such that 



P 



CNp 3 / 



2 



exp{-CA^p 3/2 log(l/p)} < F(X > tfi) < exp{-cNp 3 / 2 }. □ 



Remark 2.7. Arithmetic progressions are partition regular, a name intro- 
duced by Rado for all linear systems the solutions of which satisfy theorems 
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similar to the van der Waerden theorem. But, in addition, they are also 
density regular, which means that every subset of integers of positive den- 
sity contains them (Szemeredi's theorem). Partition properties of random 
subsets of integers with respect to density regular systems were studied in 
[9j. Schur triples form an example of partition regular but not density reg- 
ular linear system. Partition properties of random subsets of integers with 
respect to Schur triples were studied in [3]. 

Remark 2.8. We have here treated the set of solutions x to Ax = as 
a hypergraph, i.e., we have treated the solutions x as fe-sets rather than 
A;-vectors. This is fine for the examples of arithmetic progressions and Schur 
triples treated above, but in general it may be more natural to regard the 
solutions x as vectors (or, equivalently, sequences) in [N] k , rather than as 
sets. We then define Ha as the subset {x : Ax = 0} of [N] k . In this 
way, we distinguish between solutions that are permutations of each other 
(for example, (x,y,z) and (y,x,z) in the Schur triple case), and we allow 
repeated values. 

It is possible to prove a version of Theorem 12.11 for this case, using essen- 
tially the same proof, but the possibility of repeated elements of T = [N] 
complicates the conditions; we now need bounds on the number of vectors 
in Ha that have j coordinates fixed, and at most I distinct values of the 
other coordinates. We omit the details. 

2.2. Further examples and remarks. 



Example 2.9. In the dense case, that is, when q = k, assumption (iv) 
holds trivially by averaging over all subsets Tq of a suitable size, provided 
the necessary condition t < p~ k is satisfied, but this result has been known 
already (cf. [5] and [6]). In particular, this case covers the number of match- 
ings of size A; in a random r-uniform hypergraph G^ r \n,p), by considering 

a A;-uniform hypergraph H where the vertices are the edges of the complete 

(r) (r) 
r-uniform hypergraph Kn and the edges are the matchings of size k in K n . 

Then the assumptions of Theorem 12.11 hold with q = k. 

Remark 2.10. It can be very hard to improve upon Theorem 12.21 because 
it contains the triangle count problem from Indeed, with T = (^) an d 
H being the family of the edge sets of all triangles in K n , we have N = (g) 
and \H\ = 6(n 3 ) = 6(iV 3 / 2 ), so q = 3/2. To get the result from @], we 
would need to improve the upper bound, but this seems to be impossible 
without "seeing" the vertices of the random graph. 

3. Rooted subgraphs of random graphs 

A rooted graph (R, G) is a graph G with a fixed independent set R; we 
also say that the graph is rooted at R. (For simplicity, we sometimes use 
G to denote the rooted graph (R, G) when R is clear from the context.) 
Counting rooted subgraphs of a random graph G(n,p) with a fixed set R of 
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roots plays an important role in studying the so called extension statements 
and 0-1 laws in random graphs, see, e.g., Sections 3.4 and 10.2]. Another 
application can be found in [8], where a sharp concentration of the number 
of paths of given length connecting two given vertices is utilized. Here we 
give a quite accurate estimate of the upper tail of the number of rooted 
copies of a given rooted graph in G(n,p); the result is similar to our main 
result in [4] for unrooted graphs, but somewhat simpler, except for a new 
complication for constant p. 

A rooted graph (R' , H) is a rooted subgraph of (it!, G) if H is a subgraph 
of G and R' = V(H) n R. We let N R (G, H) denote the number of rooted 
copies of H in G. 

Given a rooted graph (R, G) and a graph F on the vertex set V(F) = 
[n] = {1,2,..., n}, let r = \R\ and regard F as rooted on [r] = {1, . . . , r}; 
we say that a rooted subgraph of ([r], F) isomorphic to (R, G) is an R-rooted 
copy of G in F. Thus N R (F, G) is the number of -R-rooted copies of G in F. 
In particular, when F is a random graph G(n,p), we let the random variable 
X = X R = X R (n,p) be the number N R (G(n,p),G) of -R-rooted copies of 
G in G(n,p). We further define 

/x = fi R (G, n,p) := KX R = N R (K n , G)p e ^ . (3.1) 

For a subgraph H of G let H — R be the graph obtained from -ff by 
deleting all vertices of R (together with incident edges), and define 

m R = V%(n,p) := n< H ~ R ^p< H \ (3.2) 

Note that = 0(EXjf ), with = fln V(.ff), but as defined, it does not 
depend on the actual set R' of roots of H. 

Recall that, for a graph H, the fractional independence number a*(H) is 
defined as the maximum value of Y^i x i over an assignments (xi)i e v(H) such 
that < Xi < 1 for all vertices i € V(i?) and Xi + Xj < 1 for every edge 
ij G iT. We let 

M R , G = M R>G (n,p) = min ^ R ) 1/a * {H ~ R) . (3.3) 



We further let 



m fl (G) := max > 0, (3.4) 

v 7 HCG,e(H)>0v(H - R) v ' 



and note that (pTBj) . (|33j) and (j3T4j) imply that 

Mr jG < 1 ^ np m « (G) < 1. (3.5) 

By the same argument as for the unrooted case in [7J Section 3.1], it is 
easy to show that p = n _1 / mi?( ' G '* ) is the threshold for the appearance of an 
-R-rooted copy of G in G(n,p). 

Let e R {G) = e(G) — e(G — R) be the number of edges in G incident with 
the root set R. We assume below that e R (G) > 0; the case e R {G) = 
is uninteresting since then X R equals the number of copies of the unrooted 
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graph G— R in G(n,p) — [r], which we identify with G(n—r,p), so Xq(ti,p) = 
Xg-r(ti — r,p) and we may apply the results of [4]. 

Theorem 3.1. For every rooted graph (R,G) with en(G) > and for every 
t > 1 there exist constants c = c(t,G) and C = C(t,G) such that for all 
n > v(G), with p 1 := f-Ve«(G) an d P2 ■= t~ l K G h 

(a) Ifp< n-V^«(<3) ; then 

p c = exp{-Clog(l/p)} < F(X§ > tn) < exp{-c}. 

(b) Ifn' 1 /™^ <p< pi, then 

p CM R , G = eX p{_ CMi?iG log(l/p)} < F(X% > tfi) < exp{-cM R>G }. 

(c) If pi < p < pi, then 

exp{-C(n + (p - pifn 2 )} < F(X§ > tji) < exp{-c(n + (p - pifn 2 )}. 

(d) If P2 < P < 1, iften 

P(X^ > t^x) = 0. 

Note that < p\ < P2 < 1, and that pi and j>2 do not depend on n. 
Before giving the proof, we make some comments. 

(i) Case (d) is trivial, because p > P2 < ^=> tp e ^ > 1 <^=^> tfi > 
N R (K n ,G), see (|3.ip . so it is impossible to get at least tfi rooted 
copies of G on n vertices. 

(ii) Case (a) is uninteresting and included only to show that the esti- 
mates in (b) extend in a continuous way to smaller p. (Note that 
M R ^ G = 1 at the threshold p = n^ 1 ^" 1 ^, cf. (f33j) .) Indeed, in case 
(a) we are below the threshold, so typically Xq = 0. 

(hi) If eji(G) = e(G), or equivalently e(G — R) = 0, i.e., all edges in G 
have a root as one endpoint, then p\ = p2 and case (c) disappears, 
so that (b) is valid until the cutoff at p2- For all other G, p\ < p2 
and case (c) appears, so there is a phase transition at p\. 

(iv) In the unrooted case in [4] there is also a phase transition at p = 
n -i/AG_ This has no counterpart in the rooted case. 

(v) Since en(G) > 0, G has a rooted subgraph Hq which is just a single 
edge with one endpoint in R; we have = np and o*{Hq — R) = 
a*(Ki) = 1, so 

M R , G <{yX ) 1/a * iH °- R) =np<n. (3.6) 

Hence, the upper bound in (b) is never stronger than exp{— B(n)}. 

(vi) In (b) the exponents in the lower and upper bound are of the same 
order of magnitude except for the logarithmic term log(l/p); this 
inaccuracy disappears obviously for p constant. 

(vii) For any fixed p > (or p = p(n) G [po, 1] for some constant po > 0), 
tfg = Q(n v( - H - R ^). Since a*(H-R) <v{H-R) for all H C G, with 
equality for at least one H with e(H) > 0, viz. a single rooted edge, 
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(|3.3p shows that then Mrg = ®(n). Consequently, the result in (b) 
can be written for constant p < p\ as ¥(X R > tfi) = exp{— 0(n)}. 
This shows that the bounds in (b) and (c) agree at p = p\. Moreover, 
we obtain the following corollary. 

Corollary 3.2. With assumptions and notations as in Theorem \3.1[ assume 
further that p is fixed. 

(a) If < p < p\, then 

P(X§ > tn) = exp{-6(n)}. 

(b) If pi < p < p2, then 

P(X§ > tn) = exp{-6(n 2 )}. 

(c) If p2 < p < 1, then 

F(X§ > tpt) = 0. 

The sudden jump in the exponent from n to n 2 at p = p\ (for G with 
e(G — R) > 0, so p\ < P2) may be surprising, and has no counterpart in 
the unrooted case in [4j. It may roughly be explained as follows (see the 
proof): If p < pi, then it suffices (typically) to have all 0(n) edges from 
the roots present in G(n,p) in order to have more than tfi rooted copies 
of G. However, if p > pi, this is not enough, and we need also (typically) 
a larger proportion than p of the ( n r r ) other possible edges, which by the 
usual Chernoff bound has probability only exp{— 0(n 2 )}. 

Proof of Theorem I3.il We mostly follow closely the proof for the unrooted 
case from [3], and therefore omit some details. As remarked above, (d) is 
trivial. Part (a) can be proved by a modification of the argument below, 
replacing Mr : q by 1; we omit the details and refer to the corresponding 
argument in [3] . Hence we consider only (b) and (c) . We let C\ , C2 ■ ■ ■ and 
ci, C2, • • • denote constants that may depend on G and t, but not on n or p. 

Upper bounds: If (R,H) is a rooted graph, let N R (n,m, H) be the max- 
imum of N R (F,H) over all rooted graphs F with v(F) < n and e{F) < m 
and with a set of roots of size \R\. In other words, N R (n, m, H) is the max- 
imum number of copies of (R, H) that can be packed in n vertices and m 
edges with a given set of \R\ roots. 

Let us start with the observation that if the minimum degree 5(H) > 
then 

N R (n, m, H) < N R (2m, m, H) = 0(N(2m, m,H - R)). (3.7) 
Indeed, for any F with v(F) < n, e(F) < m, and 6(F) > 0, we have 
v(F) < 2m, so the left hand side inequality follows. To prove the right hand 
side inequality, assume that F and H have the same set of roots R. Then 

N R (F, H) < N(F — R, H — R) x 2 mv{H) ~ W) = 0(N(2m, m,H - R)). 

Now, to prove the upper bound on F(X R > tfi), as before, we want to 
show that, say, EX m < t m / 2 ^ m , where X = X§, p = E X, and m is suitably 
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large. Similarly as in [3] and, as a matter of fact, similarly to the proof of 
Theorem 12.11 here, an inductive argument yields, for all m > 1, 

/ N R ' (n, (m - l)e(G), H) \ m ~ l 

^ HCG H ' 

where the sum extends over all rooted subgraphs (R' , H) of (R,G) with 
5(H) > 0. (H corresponds to the subgraph spanned by the edges in the 
intersection of the mth copy of G and the union of the m — 1 previous 
copies, and as such has 5(H) > 0.) 

We take m := \c\Mr i q\ for a suitable small constant c\ € (0, 1) to be 
fixed later. By (|3.7p . [4, Theorem 1.3] and (|3.3p . for every HCG with 
5(H) > 0, assuming m > 2, 

N R ' (n, (m - l)e(G),H) < C 2 N(2(m - l)e(G), (m - l)e(G),H - R) 

= Q(m a ' {H - R) ) = e((ciM /?iG ) a *^- /? )) 

< C 3 ci*£ . 

Hence, yields (the case m = 1 being trivial), EX m < ^(l+C^)"* -1 . 
We choose c\ so small that 1 + C^c\ < t 1 / 2 , and then Markov's inequality 
yields 

P(X > i/x) < ^ < t- m / 2 < exp{-o 2 M fl)G }. (3.9) 

In particular, this yields the upper bound in (b). 

For the upper bound in (c), we note that each rooted copy of G in K n 
yields a copy of G — R in K n — R = K n _ r ; conversely each copy of G — R 
in K n — R can be extended to exactly g rooted copies of G in K n , for some 
integer g > 1 depending on G. Hence, X R (n,p) < gXG-R(n — r,p). Further, 
N R (K n , G) = gN(K n _ r , G - R) so 

fi = N R (K n , G)p< G) = gN(K n - r ,G - R) p <G-R)+e R (G) 

= gfx(G - R,n-r,p)p eR( - G) . (3.10) 
Consequently, 

¥(X§ > tti) < P(gX G ^ R (n -r,p)> tg^(G -R,n- r,p)p e *^) 

= P(X G _ R (n-r,p) >tp eR( - G) n(G-R,n-r,p)). (3.11) 

Let t:= tp eR ( G \ and note that, for (c), 1 < t < t. By (U Theorems 1.2 and 
1.5, and Remark 8.2], recalling that t is fixed and p > p±, 

P(X G - R (n - r,p) > tfi(G - R,n - r,p)) < exp{-c 3 (t - l) 2 n 2 }. (3.12) 

Further, 

1-1 = tp eR{ - G) - 1 = (p/pi) eR{G) - 1 > p/pi -l>p-pi, 
so l|5H]> -l|515 jl yield 

F(X R > tfj) < exp{-c 3 (p - pifn 2 }. (3.13) 
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The upper bound in (c) now follows by taking the geometric mean of (|3.9p 



and (13. 13[) . noting that in this range of p, Mr^g = &(n) as remarked in (vii^ 
above. 

Lower bounds: Let H be a subgraph of G such that e(H) > and 

M:=M R , G =(^) 1/a * {H - R} . 

Since we consider parts (b) and (c) only, M > 1 by (|3.5p . 

Set po = (Svct)^ 1 and assume first that p < p$. (Note that po < t^ 1 < 
p\.) We construct, as in [4], a graph F with 

v{F) < 3{v G - r)tM, e(F) = O(M), and N(F, H — R) > 2M§. 

(3.14) 

This is done as follows. Let {xi)i£V(H-R) De an optimal assignment for the 
fractional independence problem, that is, < x^ < 1, xi + Xj < 1 for every 
edge ij € H — R, and Ya x i = ot*{H — R). Construct F by blowing up 
each vertex of H — R to a set of [^M 21 *] vertices and replacing each edge of 
H — R by the complete bipartite graph. This yields (13. 14[) . where we have 
put 3 rather than 2 because of the ceiling. Now, by (|3.6p . 



v (F) < 3(vg — r)tM < 3(vg — r)tnp < (1 — r/vG)n < n — r. 

We may thus fix a copy F± of F with V{F\) C [n] \ [r]; we further let Fi 
be F\ enlarged by adding all roots 1, . . . , r together with all rv{F\) = 0{M) 
edges between the roots and V{F\). Now, exactly as in [4], it follows from 
[H Lemma 3.3] that 

HXg > tfl) > i/( G ) F(G(n,p) D F 2 ) = lf(G)+e{F 2 ) = pQ (M)_ 

This proves the lower bound in (b) when p < po- 

Assume now that po < p < P2 and note that the lower bound we want to 
prove can be written as exp{— 0(n)}, see (vii) above or Corollary 13.21 

Consider first the case e(G — R) = and observe that then the maximum 
number of copies of G are obtained as soon as all edges from the roots 
appear, so, denoting this event by £ R , 

F(X§ > tfi) > F(£ R ) = p T(n ^ > e" C5n , 

which proves the lower bound in (b) in this case. (Since e(G—R) = implies 
pi = p2, (c) is trivial.) 

Thus, it remains to consider the case when po < p < P2 and e(G — R) > 0. 
We note first the trivial bound 

P{X§ > tfi) > P(G(n,p) = K n ) =pG) > e- C6n \ (3.15) 

Let Z be the number of edges in G{n — r,p). Since Z has binomial distribu- 
tion Bin(( n ~ r ),p) with mean P\^), it is easily seen that if (1 + 35)p < 1, 
and £s is the event {Z > (1 + 35)p{ n ^ 2 r )}, then 

> c 4 exp{-C 7 n 2 5 2 }. (3.16) 
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(The Chernoff bounds are essentially sharp, as is easily seen using Stirling's 
formula.) The number Xc-R{n — r,p) of copies of G — R in G(n — r,p) is a 
sum of N(K n ^ r , G — R) indicator variables I a . Conditioned on Z = z, each 
of them has the expectation 

p(/„ = i\z = z)= ''iff-* = ( ™i T ) ,< " R, (i - o(.-': 

U 2 ))e(G-R) M 2 

(3.17) 

Let iV<5 := (1 + 3(5)p( n 2 T ). If 8 > n 1 , z > N$ and n is large enough, then 
flSHZD yields 

P(J a = 1 | Z = z) > (1 + 3g)e(G-R) p <G-R) fa _ 0(n" 2 )) 
> (1 + 2% £ ( G -^. 

Consequently, if S > n~ l and n is large enough, then F(I a = 1 | £,5) > 
(1 + 25)p e ( G ~ R \ and summing over a we find 

E(X G „ R (n - r,p) I 5j) > (1 + 2^)EI G _ fi (n - r,p) = (1 + 2£)/x(G - i?). 

Hence, by Lemma 3.2 of [3], as in the proof of Lemma 3.3 therein, with 1/2 
replaced by we obtain 

F(* G _ fi > (1 + SHG - R) I ft) > ( T ^) 2 ^|^L_ > 

Assuming also the presence of all edges from the roots, i.e., the event £ R , 
we have X R = gX G -R (where g is as in the proof of the upper bound); 
further, by (|3.10p . n = gu(G — R)p eR ^; hence the inequality Xq-r > 
(1 + 5)u(G — R) is equivalent to 

> (1 + 5)p~ eR ^n. (3.18) 

Consequently, 

F(X R > (1 + 5)p~ eR ^n I £ R ,£ S ) > F(X G _ R > (1 + 5)a(G - R) \ £ 5 ) > c 5 <5 2 
and thus, by (|3.16p . 

F(X§ > (1 + 5)p~ eR{G) u) > c 5 8 2 F(£ s and £ R ) = c 5 5 2 F(£ s ) F{£ R ) 

> c 6 n~ 2 exp{-C 7 8 2 n 2 }p rn = exp{-9(6V + n)}, 

provided 1/n < 5 < |(p _1 — 1) and n is large enough. 

For po < p < pi, we choose 5 = n _1 ; then the right hand side of (13. 18ft is 
greater than p 1 a = t/j,, so we obtain 

F(X§ > tu) > exp{-9(n)}, 

which as remarked above is equivalent to the lower bound in (b) for this 
range of p. 
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Finally, if p\ < p < p% , we take 

5 := max{ip e « (G) - = max {(p/pi) efl(G) - 1,1/n} = 8(p-pi+l/n), 

so that the right hand side of (13.181) is again at least i/U. This yields the 
lower bound in (c) when n is large enough and p > p\ is small enough to 
guarantee that 5 < — 1)- For larger p, as well as for small n, we simply 

use (|3.15p . This completes the proof of the lower bound in (c). □ 



3.1. Examples and remarks. It is easy to see that the minimum defining 
M = Mr q in (|3.3|) is achieved by a subgraph H of G such that H — R is 
connected and, for every vertex v £ H, H contains all edges leading from v 
to R. These observations simplify computations of the bounds in Theorem 

EH 

Example 3.3. Cliques rooted at a vertex. Let G = K^, k > 2, and 

r = \R\ = 1. Then mn{G) = k/2 and eji(G) = k — 1. To find M, consider 
first the candidates H = Ki (with the root contained in H) and H = 



G = K^. For H = K2, we have, as shown in general in comment (v) 



above, i^^j) 1 ^ ^ ^ = np. For H = we have ^ K k = n k and 

a*(K k - R) = a*(K fe _i) = (k - l)/2, and thus J V«* = n 2 p k_ 
Hence, 

M < min{rap,n 2 p fe }; (3.19) 

we will show that equality holds. 

To this end, consider a general H C G with e(H — R) > and let 
F--H-R. Then e(if) < e(F) + v{F) and so, see (jO> . 

„e(F)-H>(F) 

> 



( np )a'(lT-Ji) - (np)«*{F) 

= (np^y^-^^p^)-^- 2 )^)- '*^) (3.20) 

and, dividing (13^201) by (np k - 1 ) a * iH ~ R \ 

> ^ npfc _i^(fO-2a*(F) p e(F)-(fc-2)(t»(fO-a*(F)) i (3 21 ) 



( n 2 p fc)a*(H-i?) 

Since ±v(F) < a*(F) < v(F), we have w(F) - a*(F) > while - 
2a*(F) < 0, so {np k - l ) v{F) - a * {F) > 1 if np*" 1 > 1 and {np k - l ) v{F) - 2a * (F) > 
1 if np k ~~ 1 < 1. Further, by [U Lemma 6.1], since F C G — R = iffc-i, we 
have e(F) < (/c — 2)(v(F) — a*(F)), and thus p^-MM^)-"*^)) > x for 
all p 6 (0,1]. Consequently, at least one of the right hand sides of (|3.20p 
and ([33T]) is > 1, so 

^>mm{(np) at( - H - R \(n 2 p k r^ H -^}, 
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or (*g)V«*(^--R) > mm {np,n 2 p k }. Finally, by ([33]) and (IXT91) . 

. j 2 n JnVi P < n~ 1/(fc_1) , 
M = mm{np,n p } = < -V(k-l) 

\np, p>n Vl* L >. 

Example 3.4. Bipartite graphs rooted at one whole side. These are 



exactly the graphs with e(G — R) = 0, and sopi = p2 (see comment (iii) after 
Theorem 13. ip . The two classes of the bipartition are R and S = V(G) \ R. 
Since the only connected subgraph of G — R is K\ , and a* (K\ ) = 1 , we have 
from (|3.3p and the comments above that M = np^ s<yG \ where Ag(G) := 
max^gg dc{v) is the maximum degree in G among all the vertices of S. 
Consequently, the upper bound in part (b) of Theorem 13.11 is of the form 

F(X§ > tu) < exp{-Q(np As ^)}. 

It follows from the above example that the bounds on F(Xq > tu) for K S 2 
with r = 2 and for even cycles C2 S with r = s are the same, since in both 
cases As(G) = 2 . This is a special case of a more general phenomenon that 
the bounds depend only on the structure of G — R and the degree sequence 
\Ng(v) (~l R\, v G V(G) \ R. Our next example provides one more instance 
of that. 

Example 3.5. Paths rooted at the endpoints and cycles rooted at 
a vertex. Let G = P k be a path with k vertices, k > 3, and let R be the 
set of its two endpoints. Then mn(G) = tEj}i and so p > n - 1 / m R( G ) implies 
that np — > oo as n — > oo. The minimum in M can be achieved only on a 
subpath H on at most k — 2 vertices containing one root, or H = P^. So, 

f ( x l/R/21 / ._, ._ A l/r(*-2)/2l1 

M = min < min n l p l , I n p ( 



KKfc-3 



The terms with even I are all equal to (np) 2 while for odd I they are equal 
to (np) 2l /( l+1 \ which means that the smallest among them is np, the term 
corresponding to a single rooted edge. Hence, for even k, M = np if p > 
71 -( fc -2)/ fc ) anc i otherwise M = n 2 p 2i - k ~ l ^( k ~ 2 \ the term corresponding to 
H = G. A similar cutoff for odd k occurs at n~( k ~ 3 '< with M taking 
the values of n 2 ( k ~ 2 ^ ( k ~^ p 2 and np, in turn. 

Finally, note that if R' is a single vertex in a cycle C^-i, k > 4, then 
m R ,{C k ^) = m R {P k ), y$ k i = y* h , a*(C fe _! - R') = a*(P k - R), and the 
same is true for all other candidates for the minimum in M, that is, paths 
with a root at one end. Thus, M R ' t c k _ 1 = ^R,P k an d the upper tail bounds 
provided by Theorem 13.11 are the same for these two rooted graphs. 

Remark 3.6. In the unrooted case, the lower tails are typically much 
smaller than the upper tails (see Remark 8.3 in [3]), and at best they can 
be of the same order of magnitude, e.g., whenp is fixed. Here, we encounter 
an opposite situation. Namely, for every (R, G) with en(G) > and a fixed 
p, by the FKG inequality, we have for any t > 1 

< tfi) > F(X§ = 0) > F(e R (G(n,p)) = 0) = exp{-9(n)} 
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while for t > 1 and p\ < p < p2, by Corollary 13.21 

F(X§ > tfi) = exp{-G(n 2 )}. 

Remark 3.7. If there are no isolated vertices in H — R and n > v(H), 
m > e(H), then (|3.7p may be improved to 

N R (n,m,H) = @(N(n,m, H - R)) = 0(iV(mm(n, 2m), m, H-R)). (3.22) 

Note, however, that this fails if H contains a vertex whose all neighbors 
are among the roots; for example if H is a rooted edge and n > m, then 
N R (n, m, H) = m and N(n, m, H — R) = n. 

For the lower bound in (|3,22p . take a graph Fq (with V(Fq)DR = 0) which 
achieves the maximum in N(n — r, m/3r, H — R); we may assume that Fq 
has no isolated vertices, and thus at most 2m/(3r) vertices. Then join all 
vertices of R to all vertices of Fq, obtaining a graph Fx which contains 
R, has at most n vertices, at most m/(3r) + r2m/(3r) < m edges, and 
is such that N R (Fx,H) > N(F ,H - R). Hence, N R (n,m,H) > N(n - 
r,m/(3r), H — R). Finally, provided m > 3re(H — R), we use the fact 
proved in [4] that if n' = 0(n), m' = Q(m) and n,n' > v(H), m,m' > e(H), 
then N(n',m',H) = Q(N(n,m, H)) (this follows directly from [4j Theorem 
1.3]). The case e{H) < m < 3re(H — R) is trivial, since then both sides of 
<^2\i are 6(1). 
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